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Abstract. We present a new approach to metric Diophantine approximation on 
manifolds based on the correspondence between approximation properties of numbers 
and orbit properties of certain flows on homogeneous spaces. This approach yields a 
new proof of a conjecture of Mahler, originally settled by V. G. Sprindzuk in 1964. 
We also prove several related hypotheses of Baker and Sprindzuk formulated in 1970s. 
The core of the proof is a theorem which generalizes and sharpens earlier results on 
non-divergence of unipotent flows on the space of lattices. 



1. Introduction 

We start by recalling several basic facts from the theory of simultaneous Dio- 
phantine approximation. For x, y G M n we let 

n 

x- y = y^ Xiyi, ||x|| = max \xi\, 

' ' Ki<n 
i=l 



n(x) = \xi\ and 11+ (x) = ] | 

i=l i=l 



n 



+ ) 



where \x\ + stands for max(|x|,l). One says that a vector y G IR n is very well 
approximate (cf. [S2]), to be abbreviated as VWA, if the following two equivalent 
conditions are satisfied: 

(i) for some e > there are infinitely many q G Z n such that 



(i-i) |q-y + p|-l|qir< 



-ng 



for some p G Z; 

(ii) for some e > there are infinitely many q G Z such that 

(1-2) \\qy + p\\ n -\q\<\q\~ £ 
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for some p G Z n . 

The equivalence of (1.1) and (1.2) follows from Khintchine's Transference Prin- 
ciple, see e.g. [C, Chapter V]. It is well known (see [C, Chapter VII]) that almost 
every y G MJ 1 is not VWA. A more difficult question arises if one considers almost 
all points y on a submanifold M of M n (in the sense of the natural measure class 
on M). In 1932 K. Mahler [Mah] conjectured that almost all points on the curve 

(1.3) M = {(x,x 2 ,...,x n ) | x G R} 

are not very well approximable. V. Sprindzuk's proof of this conjecture (see [Spl, 
Sp2]) has eventually led to the development of a new branch in approximation the- 
ory, usually referred to as "Diophantine approximation with dependent quantities" 
or "Diophantine approximation on manifolds" . According to Sprindzuk's terminol- 
ogy, a submanifold M C M, n is called extremal if almost all y G M are not VWA. 
Since 1964, extensive classes of extremal manifolds have been found by Sprindzuk 
and his followers, see [Sp3, Sp4] for a review. However, the following conjecture 
made by Sprindzuk in 1980 has remained unproved: 

Conjecture Hi [Sp4]. Let fi,...,f n be real analytic functions in x G U , U a 
domain in M, d , which together with 1 are linearly independent over ffiL Then the 
manifold M = {f(x) | x G U} is extremal. 

The case n = 2 was settled by Schmidt in 1964 [SI], and very recently V. Beres- 
nevich and V. Bernik [BeBe] proved the above result for n = 3. 

Note that the validity of the above conjecture for polynomials fi, • • . ,f n £ QM 
with deg(fi) < n follows from Sprindzuk's original proof of Mahler's Conjecture. 
However, there exists a strengthening of Conjecture Hi which has not been proved 
even for the curve (1.3). Namely, one says that y G M. n is very well multiplica- 
tively approximable (to be abbreviated as VWMA) if the following two equivalent 
conditions are satisfied: 

(i) for some e > there are infinitely many q G Z n such that 

(i.im) |q-y + p|-n + (q)<n + ( q )- £ 

for some p G Z; 

(ii) for some e > there are infinitely many q G Z such that 

(1.2M) n(gy + p).|g| < \q\- £ 

for some p G Z n . 

In other words, instead of taking the nth power of the maximum of the "differ- 
ences" \qyi +Pi\ in (1.2) one simply multiplies them, and instead of taking the nth 
power of the maximum of the numbers \qi\ in (1.1) one multiplies those of them 
which are different from zero. The equivalence of (1.1M) and (1.2M) can be shown 
by modification of the argument needed to prove the standard form of Khintchine's 
Transference Principle, see a remark in [Sp3, p. 69]. A manifold M C W 1 is said 
to be strongly extremal if almost all y G M are not VWMA. Clearly (l.i) implies 
(l.iM), i = 1, 2; therefore any strongly extremal manifold is extremal. In his book 
[B], A. Baker raised the question of confirming the strong extremality of the curve 
(1.3). Later, in his review [Sp4], Sprindzuk also stated 
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Conjecture H 2 . Any manifold M C M. n satisfying the assumptions of Conjecture 
Hi is strongly extremal. 

We remark that the validity of the above conjecture for polynomials with n < 4 
is the recent result of V. Bernik and V. Borbat [BeBo]. See also [VC] and references 
therein for other special cases. 

In this paper we present a proof of the above conjectures as well as some of 
their generalizations. More precisely, consider a <i-dimensional submanifold M = 
{f (x) | x G U} of M n , where U is an open subset of R d and f = (/i, . . . , f n ) is a 
C m imbedding of U into M. n . For / < m, say that y = f(x) is an l-nondegenerate 
point of M if the space lR n is spanned by partial derivatives of f at x of order 
up to /. We will say that y is nondegenerate if it is /-nondegenerate for some I. 
One can view this condition as an infinitesimal version of not lying in any proper 
affine hyperplane, i.e. of the linear independence of 1, /i, . . . , f n over R. Indeed, 
if the functions fi are analytic, it is easy to see that the linear independence of 
1, /i, . . . , f n over 1 in a domain U is equivalent to all points of M = f(U) being 
nondegenerate. Thus Conjecture H2 would follow from 

Theorem A. Let /i,...,/ n G C m (U), U an open subset ofR d , be such that 
a.e. point of M = {f(x) | x G U} is nondegenerate. Then M is strongly extremal. 

Note that if n = 2 and d = 1, the above assumption says that the curve 
{ / 2 (^)) I x G U), UcK, has nonzero curvature almost everywhere. This 
was exactly the assumption used by Schmidt [SI], and the strong extremality of 
such a curve is the subject of Theorem 14 from [Sp3, Chapter 2]. 

Our proof of Theorem A is based on the correspondence (cf. [Dl, Kl, K2]) 
between approximation properties of numbers y G lR n and behavior of certain 
orbits in the space of unimodular lattices in R n+1 . More precisely, one acts on the 
lattice 

(1.4) A y = (J £)r- 

by semisimple elements of the form 

n 

(1.5) g t =diag(e t ,e-\...,e- t "), t = (t u . . . , t n ), U>0,t = J2^ 

i=i 

(the latter notation will be used throughout the paper, so that whenever t and t 
appear in the same formula, t will stand for Yl7=i h)- 
Define a function 5 on the space of lattices by 

(1.6) 5(A) = f inf II v II 

veA\{0} 

(note that the ratio of 1+ log (l/<5(A)) and l+dist(A, Z n+1 ) is bounded between two 
positive constants for any right invariant Riemannian metric "dist" on the space 
of lattices). We prove in §2 that for any very well multiplicatively approximable 
y G lR n there exists 7 > and infinitely many t G 17] such that 



(1.7) 



S(g t A y ) < e~+ ; 
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in other words, if dist(<7 t A y , Z n+1 ) grows sublinearly as a function of t, then y is not 
VWMA. Thus to prove the strong extremality of M as in Theorem A it is enough 
to show that for any nondegenerate point yo = f( x o) there is a neighborhood B of 
xo in U such that for almost all y G f(B) and any 7 > 0, there are at most finitely 
many teZ" such that (1.7) holds. 

In view of Borel-Cantelli, the latter can be done by estimating the measure of 
the sets 

(1.8) E t = {x G B I <% t Af(x)) < e"^} 
for any fixed t G Z™ so that 

(1.9) J]|£ t |<oo 

tez™ 

(here and hereafter | • | stands for the Lebesgue measure). Such estimates are 
obtained in §5 by modifying proofs of earlier results on non-divergence of unipotent 
flows in the space of lattices. According to the theorem of Dani [D2] strengthening 
an earlier result of Margulis [Mar], for any c > and any lattice A in R k there 
exists e > such that for any unipotent subgroup G M} of SXfc(IR) one has 

(1.10) \{x G [0,T] I S(u x A) < e}\ < cT . 

Similar estimates are known for any polynomial map from R d to GL k (R) instead 
of x — > u x , cf. [Sh] (or [EMS] for a bigger class of maps). In §3 (see Proposition 
3.4 for a precise statement) we show that if yo = f( x o) is nondegenerate, xo has a 
neighborhood on which linear combinations of 1, /1, . . . , f n behave like polynomials 
of uniformly bounded degree. Then in §§ 4 and 5 we modify the argument of 
Margulis and Dani in order to get a quantitative relation between c and e in the 
analogue of (1.10) (see Proposition 2.3) which will guarantee convergence in (1.9). 
The last section of the paper deals with several possible extensions of the main 
result, as well as some open questions. 

2. Reduction to a statement about lattices 

Given a vector y G R n , consider a lattice A y in IR n+1 defined as in (1.4). Note 
that elements of A y are of the form 

(2.1) (q-y + p, qi, •••,?»»), 

where p G Z and q = (qi, . . . , q n ) G Z n . We are going to consider the action 
of the semigroup {gt} as in (1.5) on the space of unimodular lattices in IR n+1 , 
i.e. the homogeneous space SX n _|_i(R)/S'L n _|_i(Z). The function 6 defined in (1.6) 
will be used to "describe the structure of the space of lattices at 00" : by Mahler's 
Compactness Criterion (see [R, Corollary 10.9]) a sequence A k tends to infinity in 
SL n+1 (R)/SL n+1 (Z) iff <y(A fc ) -> as k -> 00. 

The next lemma helps one to reduce problems involving approximation proper- 
ties of y to studying the rate, calculated in terms of the function 5, with which 
orbit points {sr t A y } may go to infinity. 
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Lemma 2.1. Lets > 0, y G M n and (p, q) G Z n+1 6e suc/i that (1.1M) feo/ds. Pni 

(2.2a) r = n + (q)"^TT, 

and /or i = 1, . . . , n define ti > by 

(2.2b) |g l | + =re i v 

XTien 5(otA y ) < r. 

Proof. In view of (2.1) and (1.5), the inequality to be proved would follow from the 
inequalities 

(2.3a) e*|q-y + p\ < r 

and 

(2.3b) e~ u \qi\ < r, i= l,...,n. 

One immediately deduces (2.3b) from (2.2b) and \qi\ < \q%\ + Taking a product of 
the equalities (2.2b), one gets 

(2.4) n + (q)=rV. 
Thus (1.1M) can be written in the form 

e*|q-y + p\ < r" n n + (q)- £ = r~ n ■ r n+1 , 

by (2.2a) 

which proves (2.3a). □ 

An elementary computation shows that (2.2a) and (2.4) can be combined to 
yield r = e -7 *, where 

(2.5) 7= ^ . 

v 7 n + 1 + ne 

Using this remark, we prove 

Corollary 2.2. Assume that y G 1™ is VWMA. Then for some 7 > there are 
infinitely many t G snc/i t/ia£ (1.7) holds. 

Proof. By definition, for some £ > there are infinitely many solutions q G Z n 
of (1.1M). Therefore, by the above lemma and with 7 as in (2.5), there exists a 
sequence t( fc ) G with = ££=1 tj fc) -> +00 such that 5(o t(fe) A y ) < e" 7 *^'. 

Denote by [t( fc )] the vector consisting of integer parts of t\ k \ then clearly the ratio 
of S(g t (k)A y ) and S(g^ t (k)]A y ) is bounded from above by 

||#t( fe )#[t( lfc )]H = H#t( fe )-[t( fc )]ll - e " • 

Thus one gets infinitely many solutions t e Z" of S(gtA y ) < e n e _7t , and a slight 
variation of 7 yields infinitely many solutions of (1.7). □ 

Now we are ready to start the 

Proof of Theorem A. We are given an open subset U of R d and a smooth manifold 
M parametrized by f : U — > It is enough to prove that almost every point of 
M has a neighborhood W C M such that the set {y G | y is not VWMA} has 
full measure. By virtue of Corollary 2.2, it suffices to show that for a.e. yo G M 
there exists a neighborhood W C M of yo such that for any 7 > and a.e. y G W, 
the inequality (1.7) holds for at most finitely many t G Z". 

We now state a proposition, to be proved in §5, which will easily imply the above 
statement. 
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Proposition 2.3. Let f be a C l map from an open subset U ofR d to R n , and let 

xq G U be such that R n is spanned by partial derivatives of i at xq of order up to I. 
Then there exists a ball B C U centered in x and positive constants D and p such 
that for any ti, . . . , t n > and < e < p one has 



(2.6) 



, X 1/dl 

|{xeB|%A f(x) ) <e}\ <D\-J \B 



Indeed, by the assumptions on M, almost every yo G M is nondegenerate, hence 
has a neighborhood W = f(B) with B as in the above proposition. For 7 > and 
t G Z™ consider the set i? t defined in (1.8). Then there is a constant I) independent 
on t such that for large enough ||t|| one has \E t \ < De~ lt / dl . Therefore the series 
J2tei, n \^t\ converges, hence, by the Borel-Cantelli Lemma, for a.e. x G B the 
inequality (1.7), with y = f(x), holds for at most finitely many t G Z". Since the 
measure on M is obtained from the Lebesgue measure on U by a smooth map, this 
finishes the proof of Theorem A modulo Proposition 2.3. □ 

3. Good functions and nondegenerate points 
Let V be a subset of M d and / a continuous function on V. In what follows, we 

def 

will let II f\\ B = sup xeB |/(x)| for a subset B of V. For positive numbers C and a, 
say that / is (C, a) -good on V if for any open ball B cV one has 1 

(3.1) Ve>0 |{x G B I |/(x)| < e}\ < C- ( ] ■ \B\ . 



Cf. [EMS, Corollary 2.10] for a similar condition. The properties listed below 
follow immediately from the definition. 

Lemma 3.1. Let ^Cl d and C, a > be given. 

(a) / is (C, a)-good on V <^ so is \f\; 

(b) / is (C, a) -good on V =>- so is Xf V A G K; 

(c) fi, i e J, are (C,a)-good on V =3- so is sup ieJ 

We now consider what can be called a model example of good functions. 

Proposition 3.2 (cf. [DM, Lemma 4.1]). For any k G N, any polynomial f G M[x] 
of degree not greater than k is (2k(k + l) 1 '*, l/k)-good on R. 

Proof. Fix an open interval B C R, a polynomial / e R[i] of degree not exceeding 
and an e > 0. We need to show that 

(3.2) |{* G B I |/(x)| < e}\ < 2k(k + I) 1 /" ^ \B\ . 

Denote \B\ by b and - \{x G S | < e}\ by a. Then there exist xi, . . . , Xk+i G 

B such that < £, 1 < z < k + 1, and |x« — > ab/2k, 1 < z < j < k + 1. 



Here we adopt the convention i = 00, so that (3.1) holds if /|s = 0. 
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Using Lagrange's interpolation formula, one can write down the exact expression 
for/: 



fc+l V\ k+1 (r-r) 

(3.3a) = ' . 

i=l Ylj=\,j^i\ x i x j) 

and conclude that 

(3.3b) mB < {k+1)E -Jf_, 

which immediately implies (3.2). □ 

The goal of this section is to prove that if the point yo = f( x o) of the manifold 
M = f(U) C M. n , U C is nondegenerate, then there exist a neighborhood V of 
x and positive constants C and a such that all linear combinations of 1, fi, . . . , f n 
are (C, a)-good on V. Our argument is based on the following standard but quite 
technical lemma. For i = 1, . . . , d, we let di be the operator of partial differentiation 
with respect to Xi, and for a multiindex (3 = (ii,...,id), ij G Z + , we let = 
Z)j =1 ^ and 9/3 = d} 1 o • • • o d l d d . 

Lemma 3.3. Let V be an open subset ofW 1 , and let f E C k (V) be such that for 
some constants Ai, > one has 

(3.4<) \\d f3 f\\ v <A 1 V (3 with \(3\ < k , 

and 

(3.4>) \d}f(x)\>A 2 Vxe V, i = l,...,d. 

T/ien /or any d- dimensional cube B = Bi x ■ ■ ■ x C V , where Bi are open 
intervals of the same length, and for any e > one /ias 

, x l/difc 

(3.5) |{x G B | |/(x)| < e}| < dC kM f^-j \B\ , 

with 

i/k 



C k , Al , Aa = k(k + 1) (^-(k + l)(2k k + 1)^ 



Proof. For £?, / as above and £ > 0, we will denote by 6 the sidelength of B and by 
s the supremum of |/| over B. First consider the case d = 1. Take £ > 0; since, by 
(3.4>), the /cth derivative of / does not vanish on £?, the set {x G B | |/(x)| < e} 
consists of at most fc + l intervals. Let J be the maximal of those, then 

(3-6) \{xeB \ |/0r)|<e}|<(fc + l)|/|, 

so it suffices to estimate |/| from above. Denote by r the length of I and also let 
(3.7) T = k\s/b k . 



The argument will be based on the following two inequalities: 
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Sublemma 3.3.1. (a) r < ipi(r) \ -j b, where <pi(r) = k ( — - — j T X l k ; 

reV/f J^(k + l^ 1/k 

(b) r < (p 2 (r) [-) b, where ip 2 (r) = k 2 I ^_ ^ 

Proof. Divide I into k equal parts by points x±, . . . , Xk+i, and let P be the Lagrange 
polynomial of degree k formed by using values of / at these points, i.e. given by 
the expression in the right hand side of (3.3a). Then there exists x E I such that 
p( k \x) = f {k \x), hence, by (3.4>), |p( fc )(x)| > A 2 . On the other hand, one can 

ek\ 

differentiate the right hand side of (3.3a) k times to get \P^ k \x)\ < (k + 1) ^ ~jj^k ' 
Combining the last two inequalities, one obtains 

(3.B) r<_ k (^f, k (^f & \ 

which reduces to the inequality (a) by the substitution (3.7). 

Next, let Q be the Taylor polynomial of / at x\ of degree k — 1. By Taylor's 
formula, 

ll/-QII,<ll/ (fc) ll4 ^ ^ ^k k (k + i)e, 

k\ by (3.4<) kl by (3.8) M 

therefore < 2^-k k (k + l)e. 

A 2 

We now apply Lagrange's formula to reconstruct Q on B by its values at xi, . . . , Xk- 
Similarly to (3.3b), we get 

(3.9) HQIU < k ■ 2^k k (k + < ^/ k (k + ^ ■ 

Finally, the difference between / and Q on B is, again by (3.4<), bounded from 
above by A\b k jk\, so from (3.9) one deduces that 



which is equivalent to the inequality (b) modulo (3.7). □ 

To use the above estimates, note that cpi (resp. ip 2 ) is an increasing (resp. de- 

creasing) function, therefore one has r < </?i(to) y-J 6, where To is the root of 

<Pi(t) = <P2(t). An elementary computation yields tq = Ai(2k k + 1), and the 
validity of (3.5) for d = 1 follows immediately from (3.6). 

Next we argue by induction on d, assuming that (3.5) is established for all the 
lower values of d. We represent x G R d as (xi,x') and let B' be the product of 

the intervals B 2 , . . . , B d . Denote by a the value |{x G B | |/(x)| < s}\. Choose 
a positive number A < cr, and let 

fli(A) = |xi G Bx -L |{x' G B' | /(*i,x')| < e}| > a} . 
The induction step will be based on the following 



Sublemma 3.3.2. -|£?i(A)| > a - X. 
Proof. Write 

{xG5| |/(x)| < e} C B 1 (X) x5'u{xe (B l \ B 1 (X)) x B' | |/(x)| < e} , 

and put jU = -|£?i(A)|. Then, by Fubini, b d a = |{x G B | |/(x)| < e}| is not greater 
than fyu • b d ~ x + 6(1 - //) • 6 d_1 A < 6 d (A + //), hence the claim. □ 

For xi G Si, denote by f Xl the function on B' given by / Xl (x / ) = f /(xi,x'). 
Clearly such functions satisfy assumptions (3.4) on derivatives with d — 1 in place 
of d, therefore, by the induction assumption, for any x\ G -Bi(A) one has 

1 / xl/(d-l)fc 

or 

On the other hand, for any x' G -B' the functions x\ — > /(xi,x') also satisfy the 
assumptions on derivatives (with 1 in place of d), hence are (C fe]Alij42 , l//c)-good on 
Si. By Lemma 3.1(c), the function x\ — > ||/ X i||b' is also (C fe]Alij42 , l/A;)-good on 
Bi, thus, by the above sublemma, 



<x-A< i|{xieBi|||/ xl |U' < ((d-l)CV„. 1 . J , a /A) < ' i 1)fc £ } 



or equivalent ly, 

(3-10) X d ~\a - A) < C d W2 {d - l)^ 1 . 

The function in the left hand side of (3.10) attains its maximum, a d (d — l) d_1 /d d , 
when A = o~(d — l)/d; substituting it into (3.10), one easily obtains (3.5). □ 

We now come to the main result of the section. 

Proposition 3.4. Let f = (fi, . . . , f n ) be a C l map from an open subset U ofR d 
to M. n , and let x G U be such that MJ 1 is spanned by partial derivatives of f at x 
of order up to I. Then there exists a neighborhood V C U o/x and positive C such 
that any linear combination of 1, fi, . . . , f n is (C, l/dl)-good on V . 

Proof. Take / = cq + Y^7=i c ifi'-> * n y i ew °f Lemma 3.1(b), one can without loss 
of generality assume that the norm of c = (cq, . . . , c n ) is equal to 1. From the 
nondegeneracy assumption it follows that there exists a constant C\ > such that 
for any c with ||c|| = 1 one can find a multiindex (3 with \(3\ = k < I and 

n 

\J2 c i d M^o)\ = \dpf^o)\>Ci. 
i=i 



By an appropriate rotation of the coordinate system around x one can guarantee 
that |<9f/(xo)| > C2 for all i = l,...,d and some positive C2 independent of c. 
Then one uses the continuity of the derivatives of /1 , . . . , f n to choose a neigh- 
borhood V' C U of xo and positive Ai, A2 (again independently of c) such that 
the inequalities (3.4) hold. Now let V be a smaller neighborhood of x such that 
whenever a ball B lies in V, any cube B circumscribed around B is contained in 
V. Then for any e > one has 

\{xeB\\f(x)\<e}\<\{xeB\\f(x)\<e} 

e \ x l dk 2 d ( e x 1 ^ dk 



(by Lemma 3.3) < dC k , Al , A2 ( ) \B\ < —dC k , Al , A2 [ ) \B\ 



(here va stands for the volume of the unit ball in IR d ), which implies that / is 
{^dC t , AltA2 , l/dz)-good on V. □ 

4. Maps of posets into spaces of good functions 

In this section we will work with mappings of partially ordered sets (posets) into 
spaces of functions on balls in M d . Given a mapping from a poset to the space of 
functions on a ball B, we will mark certain points of B (see the definition below), 
and prove an upper estimate (Theorem 4.1) for the measure of the set of unmarked 
points. Then in §5 we will use this estimate to generalize and strengthen results on 
non-divergence of unipotent flows on spaces of lattices obtained in [Mar, D2]. 

In what follows, _B(x, r), where x G M. d and r > 0, will stand for the open ball 
of radius r centered in x. For a poset S, we will denote by l(S) the length of S 
(i.e. the number of elements in a maximal linearly ordered subset of S). If T is a 
subset of S, we let S(T) be the poset of elements of S \ T comparable with any 
element of T. Note that one always has 

(4.1) l(S(T))<l(S)-l(T). 

For d G N, k G Z + and C, a, p > 0, define A(d, k, C, a, p) to be the set of triples 
(S, (p, B) where S is a poset, B = S(xo, ro), where xo G M d and ro > 0, and <p is 

a mapping from S to the space of continuous functions on B = f B(xo : 3 h ro) (this 
mapping will be denoted by s — > -i/> s ) such that the following holds: 

(AO) Z(S) < fc; 

(Al) Vs G 5\ ^ s is (C, a)-good on B; 

(A2) VsgS, ||V s |U>p; 

(A3)VxGS, #{s G 5 I |V> a (x)| < p} < 00. 

Then, given (5 1 , S) G «4(<i, fc, C, a, p) and e > 0, say that a point z 6 B is 
(s, S, ip)-marked if there exists a linearly ordered subset S z of 5 such that 

(Ml) e< |^ s (z)| <p VsGE z ; 
(M2) |^(z)| >p VsG5(E z ). 

We will denote by $(e, 5, <p, B) the set of all the (e, S, <p)-marked points z E B. 
We will also need to use 

10 



Besicovitch's Covering Theorem (see [Mat, Theorem 2.7]). There is an integer 
Nd depending only on d with the following property: let A be a bounded subset 
of M d and let B be a family of nonempty open balls in M, d such that each x E 
A is the center of some ball of B; then there exists a finite or countable subfamily 
{Bi} of B with 1a < J2i^-Bi < Nd (i.e. A C \J i Bi an d ^e multiplicity of that 
subcovering is at most Nd). 

The goal of the section is to prove the following 

Theorem 4.1 (cf. [Mar, Main Lemma] or [D2, Proposition 2.7]). Let d E N, k E Z + 

and C, a, p > be given. Then for all (S, <p, B) E A(d, k, C, a, p) and e > one has 

| | <kC(3 d N d ) k (^j \B\. 

The proof will be built up from Lemmas 4.2-4.6 below. In these lemmas, d, k, 
C, a and p will be as in the above theorem, and (S, cp, B) E A(d, k, C, a, p) will be 
fixed. We also define 

H{x)^{sES\ |^ s (x)|<p} 
for any xGB (this is a finite subset of S in view of (A3)), and let 

£ = {xGB #(x) ^0} = {xeB|3seS with |^ s (x)| < p} . 

Lemma 4.2. j4w/ point of B which does not belong to E is (e, S, cp) -marked for 
any positive e. In other words, B \ E C S, <p>, B), and therefore 

B \ $(e, S,ip,B)cE\ $(e, 5, v?, S) • 

Proof. Take z e -B \ E; by the definition of E, one has |'i/ ; s( z )| > P f° r a U s E S. 
Then one can take E z to be the empty set and check that (Ml) and (M2) are 
satisfied. □ 

The next four lemmas deal with some properties of the set E. Note that if 
1{S) = (which means S = 0), one has H(x) = for all x£B, therefore E = 0. 
Thus in the argument below we will tacitly assume k to be not less than 1. 

Take ~x.EE and s E H(x), and define 

r s ,x = sup{0 <r <2r Q \ HV^IU^) < p} . 

def 

It follows from the continuity of functions ip s that r S)X > 0. We also let S S)X = 
#(x, r fl , x ). 

Lemma 4.3. For any x. E E and s E H(x), one has \\iP s \\b S;X > P- 

Proof. By the definition of B s x , one has either r SjX = 2r (then B s x D S and the 
claim follows from (A2)), or ||V's|U( x ,r) > P Vr > r SjX , in which case we are done 
by the continuity of ip 8 . □ 

For any x E E choose an element s x of H(x) such that r SxjX > r S)X for all 
s E H(x) (this can be done since H(x) is finite). For brevity we will denote r SxjX by 
r x and S SxjX by B x . Note that in fact B x = U sG #(x) 5 s,x and r x = max seH(x) r S)X ; 
in particular, r x < 2ro for any x E E. 

11 



Lemma 4.4. For any x E E and s E S, one has \\ip s \\ Bx > P- 

Proof. Assume that ||V's||b x < P- Then one necessarily has IV'sWI < Pi therefore 
s E H(x) and B SjX is defined. But B SjX is contained in B x , so the claim follows 
from Lemma 4.3. □ 

For any x E E we let S x = f ^({Sx}). The induction procedure of the proof of 
Theorem 4.1 will be based on the following 

Lemma 4.5. For any x E E, one has 

(S x , <p\ Sx , B x ) E A{d, k-l,C, a, p) . 

Proof. The properties (AO) and (A2) for (S x , y?|s x , B x ) follow from (4.1) and Lemma 
4.4 respectively. To prove (Al) and (A3) it suffices to notice that 

B x d =i f B(x, 3 fc -V x ) C B(xo, 3 fc "V x + r ) 

C S(xo, (2 • 3*- 1 + l)r ) C S(xo, 3 fc r ) = S . □ 

The next lemma gives one a way to prove that a point z e -B is (e, S, </?)-marked 
provided it is (e,S x , <p\s x ) -marked for some x E E. 

Lemma 4.6. For e > and x E E, let z E B n 3>(e, S x ,<p\s x , B x ) be such that 
|V , s x ( z )| > e - Then z e <£>(£, S, <p, B). Equivalently, 

BC](B X \ $( £ , S, cp, B)) CB X \ 5 X , ^| 5x , 5x) U{zeB x |^ Sx (z)| < e} . 

Proof. By the definition of S x , <f\s x , B x ), there exists a linearly ordered subset 
£ xz of S x such that 

(4.2) e<\^ s {z)\<p VseS x , z 
and 

(4.3) |^(z)l>P VseS x (E x , z ). 

Put E z d = E XjZ U {s x }. Then S(E B ) = S x (£ x , z ); therefore (M2) immediately 
follows from (4.3), and, in view of (4.2), it remains to check (Ml) for s = s x . The 
latter is straightforward: |V> Sx ( z )| is not less than e by the assumption and is not 
greater than p since z E B x . □ 

We are now ready to give a 

Proof of Theorem We proceed by induction on k. First take (S,(p,B) E 
A(d, 0, C, a, p); since the poset S is empty, all points of B are (e, S, </?)-marked 
for any e > 0, which means that in the case k = the claim is trivial. Now take 
j > 1 and suppose that the theorem is proved for k = j — 1; put k = j and take 
(5, </?, £?) G -4(d, k, C, a, p) and a positive e. 
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In view of Lemma 4.2, it suffices to estimate the measure of E \ 5, 9?, £?). 
On the other hand, from Lemma 4.6 one deduces that for any xe£, the measure 
of the intersection of _B X \ S, <p, B) with S is not greater than 

|S x \$( £ ,5 x ,^| Sx ,S x )| + |{zgS x I |V> flx (z)| <e}| . 

The first summand is not greater than (/c — l)C(3 d iVrf) fc 1 ^-^ |£? x | by Lemma 

4.5 and the induction assumption. The second one, in view of ij) Sx being (C, a)-good 
on B D £? x , is not greater than 

c( „. £ „ Y\BJ < cf-V|BJ. 



IIV , s x ||s x / Lemma 4.4 



Consequently, one gets 
(4.4) 



J Bn( J B x \$( £ ,5,^, J B))|<c((A;-l)(3 d iV d ) fc 1 + l) Q |S X | 



Now consider the covering {S x | x e i£} of i? and, using Besicovitch's Covering 
Theorem, choose a subcovering {-B^} of multiplicity at most N d . Then 

(4.5) Y^\Bi\ <iV d |U^| < JV d |S(xo,3r )| < N d 3 d \B\ . 

i 

The sets Bn(Bj\ $(e, S, cp, B)) cover E \ $(e, 5, <p, -B), therefore 
(£? \ <&(e, 5, S) I < £ I B n (Bi \ $(e, 5, v?, S)) I 



(by (4.4)) <kC(3 d N d ) k £ |B. 

(by (4.5)) < fcC^)*" 1 Q° tf d 3 d |5| < kC(3 d N d ) h (jf \B 
and the theorem is proven. □ 

5. Quantitative non-divergence in the space of lattices 

We now apply Theorem 4.1 to the poset of discrete subgroups of Z fc , k G N. We 
fix a basis ei, . . . , ejt of M fc , and for / = {ii, . . . , ij} C {1, . . . , k}, %\ < 12 < • ■ ■ < ij, 

we let e 7 = f A • • • A C /\ J (1R A: ), with the convention e = 1. We extend 
the norm || ■ || from M fc to the exterior algebra f\(M. k ) by || J^/cfi w / e /ll = 

For a discrete subgroup T of lR fc , we denote by T R the minimal linear subspace 
of R fc containing V. Let j = dim(rm); say that w e /\ J (lR fc ) represents T if 

r 1 if i = 

w = < 

[ Vi A • • • A Vj if j > and vi , . . . , Vj is a basis of V . 
Clearly the element representing T is defined up to a sign. Therefore it makes sense 
to define the norm of V by ||r|| = ||w||, where w represents V. Note that if V is 



a lattice, the ratio of ||r|| and the volume of the quotient space Mr/T is uniformly 
bounded between two positive constants. 
We need the following simple lemma: 
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Lemma 5.1. Let V be a discrete subgroup of R k , v G R fc \ Tr, and let Actf 
be a discrete subgroup containing both V and v such that Ar = Tr + Rv. Then 
< k\\T\\ • ||v||, or, equivalently, ||v|| > ||A||/fc||r||. 



Proof. Let w represent F; since ||A|| is not greater than the norm of wAv, it suffices 
to show that ||wAv|| < fe||w|| ■ ||v||. Write w = X)ic{i it} w i e i an d v = z^£=i v i e ii 
then 

k 

i y A > vseAl < k- max \wjvA 

/C{l,...,fc} i=l Jc{l~.,fe} 

< • max ItUjI • max li^l = A; II w II ■ llvll . □ 

JC{l,...,fe} l<i<fc 

Let A be a discrete subgroup of R fc . We say that a subgroup T of A is primitive 
(in A) if T = F-r fl A, and denote by C(A) the set of all nonzero primitive subgroups 
of A. The inclusion relation makes £(A) a poset, its length being equal to the 
dimension of Ar. 

Theorem 5.2. Let d, k G N, C, a > 0, < p < 1/k, and let a ball B = B(xq, r ) C 
R d and a map h : B — > GXfc(R) 6e given, where B stands for -B(x , 3 fc ro) . -For any 

T G C(Z k ), denote by ip r the function V'r(x) = f ||^( x )r||, x G B. Assume that for 
any F G C{Z k ), 

(i) ^ r is (C,a)-good on B; 

(ii) HV'rlU > p. 

Then for any positive e < p one has 

(5.1) |{xG B | 5(h(x)Z k ) < e}\ < kC(3 d N d ) k Q |B| . 

Proof. We let £ = C(Z k ) and denote by </? the map T — > -i/V It is easy to verify 
that (S,(p,B) G A(d, k, C, a, p). Indeed, the property (AO) is clear, (Al) is given 
by (i), (A2) by (ii), and (A3) follows from the discreteness of /\{Z k ) in /\(R k ). 

In view of Theorem 4.1, it remains to prove that a point xGB with o"(/i(x)Z fc ) < 
e can not be (e, S, </?)-marked. In other words, 

(5.2) $(e, S, ip, B) C {x G B | o"(/i(x)Z fc ) > e} . 

Take an (e, S, v?)-marked point x G B, and let {0} = r C I\ C • • • C F t = Z k 
be all the elements of S x U {{0}, Z fc }. Take any v G Z k \ {0}. Then there exists i, 
1 < i < I, such that v G \ IVi. Denote (IVi + Rv) n Z k by A. Clearly A is a 
primitive subgroup of Z k contained in 1^, therefore A G E x U S^Ex). Now one can 
use properties (Ml) and (M2) to deduce that 

IV>a(x)| = ||/i(x)A|| > min(e,p) = e, 
and then apply Lemma 5.1 to conclude that 

i.w n ii ||/i(x)A|| 

This shows (5.2) and completes the proof of the theorem. □ 

As was mentioned in the introduction, our method of proof is based, with some 
technical changes, on the argument from [Mar] and its modification in [D2]. As an 
illustration, let us show how one can use the above theorem to get a quantitative 
strengthening of Theorem 2.1 from [D2]. 
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Theorem 5.3. For any lattice A in R fc there exists a constant p = p(A) > such 
that for any one-parameter unipotent subgroup {u x } xe K. of SLk(M), for any T > 
and any e < p, one has 

(5.3) \{0<x<T\d(u x A) <e}\<2k s 6 h (k 2 + l) 1/k2 (^) T. 

Proof. Write A in the form gZ fc with g G GLfc(R), and denote by h the func- 
tion h(x) = u x g. For any Y G £(Z fc ) with basis vi, . . . , v,-, the coordinates of 
h(x)(vi A ■■■ A Vj) will be polynomials in x of degree not exceeding k 2 , hence 

good on R by Proposition 3.2. In view of Lemma 3.1(c) and 

def 

the definition of the norm of a lattice, the functions ip r {x) = ||/i(x)r|| will also be 
(2k 2 (k 2 + l) 1 ^ 2 , l/£; 2 )-good on R. 

Now let p = min (l/k, infre£(z fe ) Wd^ll), positive by the discreteness of A in R fc . 
Then ip r (0) > p for any V G £(Z fc ), therefore HV'rl^cT) > P by the continuity of h. 

We see now that with the the substitutions B = (0,T), C = 2k 2 (k 2 + l) 1/k \ 
a = l/k 2 and d = 1 (note that it is an elementary fact that N\ = 2), assumptions (i) 
and (ii) of Theorem 5.2 are satisfied, and one immediately gets (5.3) from (5.1). □ 

We now derive another corollary from Theorem 5.2 which will immediately imply 

def 

Proposition 2.3. In what follows, we put k = n + 1 and for y G R n define u y = 
1 y T \ 

q j J, so that the lattice A y (see (1.4)) is given by u y Z n+1 . We also let 
{eo, ei, . . . , e n } be the standard basis of R n+1 . 

Theorem 5.4. For n G N, put k = n + 1 and let d, C, a, p and B = 5(xo,ro) 

be as in Theorem 5.2. Also let f = (fi, ■ ■ ■ , f n ) be a continuous map from B = f 
S(x ,3 n+1 r) to R n such that 

(i) for any c = (c , c 1 ,..., c n ) G R n+1 , c + J2i=i c ifi is ct)-good on B; 

(ii) for any c G R n+1 with ||c|| > 1, ||c + Yn=\ Cifi\\ B > P- 

Fix nonnegative numbers t\, . . . , t n , and for x G B let 

(5.4) h{y) = f t «f(x) , 

where g t is given by (1.5). TTien /or any positive e < p one has 

(5.5) | { xG5|^(x)Z^)< £ }|<(n + l)C(3^r 1 g) |S| . 

Proof. In view of Theorem 5.2, it suffices to show that (i) and (ii) above will remain 
true with Co + Ym=i c ifi replaced with ip r = f ||/i(-)r||, where V is any element of 
£(Z n+1 ). Take w = J2ic{o n} w i e i representing T. To see how the coordinates 
of w change under the action of /i(x) of the form (5.4), first note that the action 
of Wf( x ) leaves eo invariant and sends to + /i(x)eo, i = 1, . . . , n. Therefore 



ej if G I 

e i + Y.iei ±/i( x ) e iu{o}^{o otherwise, 
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which shows that 



Mf(x)W = Wie ' + ( Wl + ± ^u{i}^{0}/i(x) ) e 7 . 

og/ oei V J 

Applying g t to both sides of the above formula, one gets /i(x)w = J2ic{o n} ^/( x ) e /> 
where 



M*0 



e ^»6J ti w J if 7 

e Ei t iU (^ Wl + J2 i ^ I ±w IU{x} ^ m f i (x.)) otherwise. 



It follows that all the coordinates ftj(x) of /i(x)w are of the form c +^™ =1 c it fj(x) 
for some c G M n+1 . Thus, by virtue of assumption (i) of the Theorem, they are 
(C, a)-good on B. Then Lemma 3.1(c) applies and one can conclude that ip r = 
su P/c{o,...,n} * s (C,a)-good on B. Furthermore, since the coordinates w 7 of 
w are integers and at least one of them is nonzero, one can conclude that for 
some / 9 0, the function /i 7 (x) is of the form cq + Yl7=i c */* with ||c|| > 1. Hence 
ll^/IU > P for this I. Therefore HVvlU ^ Pi which is all one needs to apply Theorem 
5.2. □ 

It is now easy to write down the 

Proof of Proposition 2.3. Take U C R d , f : U -> M n , x G U and / as in the 
statement of the proposition. Using Proposition 3.4, find a neighborhood V C U 
of xq and C > such that any linear combination of 1, /\, . . . , f n is (C, l/<i/)-good 
on V. Choose a ball S = S(xo,r) contained in U and let B = -B(xq, 3 _ ( n+1 )f). 
Then condition (i) of Theorem 5.4 is satisfied with a = 1/dl, while the existence of 
positive p satisfying (ii) follows from the linear independence of 1, /i, . . . , f n on B 
over R. The validity of (2.6), with D = (n + l)C(3 d iV d ) n+1 , is now an immediate 
consequence of (5.5). □ 

6. Concluding remarks and generalizations 

6.1. Observe that a notion of (C, a)-good functions can be defined for any metric 
space X with a Borel measure fi in place of (M d , | ■ |). Moreover, the argument of 
§4 works for arbitrary (X, fx) provided two additional conditions are satisfied: 

• Besicovitch's Covering Theorem holds, the constant N d being replaced by 

some positive number N x ; 

, , , p(B(x,3r)) 
. (cf. (4.5)) sup ^ 1 < oo. 

xex,r>o p,{B(x,r)) 

These conditions, in particular, are satisfied for X = Q p and \x a Haar measure 
on Q p . This way one can attempt to apply the methods of the present paper to 
describe extremal and strongly extremal manifolds in p-adic spaces, generalizing 
the results from [Sp2, Chapter 2]. This work is currently in progress. 

6.2. One may ask whether it is possible to develop a similar proof starting from the 
inequality (1.2M) instead of (1.1M). The answer is yes, although the proof turns 
out to be slightly more complicated. The recipe is simple: one should consider 

lattices ^\ Z n+1 instead of (1.4) and act on them by elements of the form 

diag(e* 1 , . . . , e*~, e~ l ) instead of (1.5). 
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More generally, one can unify these two approaches by saying that a matrix 
Y G Mat m)Tl (R), interpreted as a system of m linear forms in n variables, is VWA 
(resp. VWMA) if for some e > there are infinitely many q G Z n such that 

(6-1) ||yq + p|p.||q|P < ||q||- ne , 

or, respectively, 

(6.im) n(y q + P ) • n + ( q ) < n + ( q )- £ , 

for some p G Z m . Then one faces a problem of describing (strongly) extremal 
submanifolds of Mat m;n (lR) 2 . It turns out that one can apply Theorem 5.2 to get 
a general result of which Theorem A is a special case. This is going to be a topic 
of a forthcoming paper. 

6.3. It is instructive to compare Theorem A with results recently obtained by 
M. Dodson, B. Rynne and J. Vickers. Following [DRV2], say that the manifold M 
satisfies condition Kl at y G M if for any v G T y M ± , at least two of the principal 
curvatures of M at y with respect to v (see [DRV1] for further details) are nonzero 
and have the same sign. It is proved in [DRV1, DRV2] that a C 3 manifold M 
is extremal provided Kl holds for almost all y G M. Now say that M satisfies 
condition KO at y G M if for any v G TyM- 1 , at least one principal curvature of 
M at y with respect to v is nonzero. Then from Theorem A it immediately follows 
that a C 2 submanifold M of lR n is strongly extremal whenever condition KO holds 
for almost every its point. Indeed, using the analytic formulation of curvature 
conditions as in [DRV1], it is easy to see that M satisfies condition KO at y if and 
only if y is a 2-nondegenerate point of M. 

On the other hand, Dodson, Rynne and Vickers were able to use stronger cur- 
vature conditions to derive several Khintchine-type theorems and asymptotic for- 
mulae, see [DRV2, DRV3]. It would be interesting to know whether it is possible 
to obtain similar results assuming only the mild condition KO. In particular, it 
is proven in [DRV2] that if Kl holds for almost all y G M and ip : N — > K + is a 
decreasing function such that 

(6.2) V^M<oo, 

then for a.e. y G M the inequality 

(6-3) |q-y+pH|qir<^(l|qir) 

has at most finitely many solutions q G Z n , p G Z. On the other hand, one can 
modify Lemma 2.1 in the spirit of [Kl, Theorem 8.3], and then apply Proposition 
2.3 with t = (t/n, . . . ,t/n) to obtain the following 

Theorem B. Let /i,...,/ n G C l (U), U an open subset of R d , be such that 
a.e. point of M = {f(x) | x G U} is l-nondegenerate, and let ip : N — > K + be a 
decreasing function such that 

(6.4) f^*> 1/d "" +1) <oo. 

q = l Q 



2 See [Kol, Ko2] where similar objects are called "systems of (strongly) jointly extremal manifolds" . 
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Then for a.e. y G M the inequality (6.3) has at most finitely many solutions. 

The details will appear elsewhere; whether it is possible to replace (6.4) by (6.2) 
in the above theorem is an open question. We remark that for M of the form (1.3) 
and ip satisfying (6.2), the conclusion of Theorem B was conjectured by A. Baker 
in 1966 and proved by V. Bernik [Be] in 1984. 
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